Evaluating two freely available geocoding tools for geographical inconsistencies and geocoding errors
نویسنده
چکیده
Background: Geocoding is highly prone to error for various reasons. This paper examines the geographical inconsistencies associated with geocoding errors seen when using two freely available geocoding tools, Google Sheets and ggmap. Methods: Two hundred restaurants, all recipients of California’s Center of Excellence award, were selected for the analysis. The geocoded addresses were plotted on maps using QGIS, Google Maps, OpenStreetMap (OSM), and Google Earth for visualization, comparison, and validation. A stepwise method of analyzing the geographical inconsistencies is provided that can be adapted for any locational analytics. Results and discussion: Both Google Sheets and ggmap were able to successfully geocode all 200 addresses, but ggmap incorrectly geocoded eight addresses as being more than 2,000 miles from their actual location. Addresses containing the ampersand character, &, caused ggmap to incorrectly geocode their location. After replacing the ampersand with the word and, ggmap was able to correctly geocode those addresses. The corrected locations plotted on Google Maps and OSM were similar, and they exactly matched the actual locations when plotted on Google Earth. Conclusions: Both Google Sheets and ggmap are equally capable of geocoding physical locations, but R users are advised that addresses for geocoding must be free of the ampersand character if correct results are to be obtained. In addition, geocoded outputs should be plotted on a map using QGIS, ArcGIS, Google Maps, OSM, R, or any other such mapping tools for visualization and validation. This will ensure a high-quality geospatial analysis of places or events when locational information is vital for decision-making.
منابع مشابه
An Approach to gecoding based on volunteered Spatial Data
The automated process of assigning geographic coordinates to textual descriptions of a place, generally referred to as geocoding, plays an important role in various fields of geographic information technologies, ranging from the analysis of healths records [28] or crime incidents [17] to location based services like route planning applications [20]. However, since the collection and maintenance...
متن کاملModeling the probability distribution of positional errors incurred by residential address geocoding
BACKGROUND The assignment of a point-level geocode to subjects' residences is an important data assimilation component of many geographic public health studies. Often, these assignments are made by a method known as automated geocoding, which attempts to match each subject's address to an address-ranged street segment georeferenced within a streetline database and then interpolate the position ...
متن کاملLeading the charge for better batteries.
Background: The assignment of a point-level geocode to subjects' residences is an important data assimilation component of many geographic public health studies. Often, these assignments are made by a method known as automated geocoding, which attempts to match each subject's address to an address-ranged street segment georeferenced within a streetline database and then interpolate the position...
متن کاملOn the Accuracy of Online Geocoders
Geocoding is the conversion of a textual description of a location to geographic coordinates. With online geocoders being freely available to researchers and practitioners alike, their in uence on data quality needs to be estimated. To this end, we rst describe the basics of address-based geocoding and accuracy issues. We then look at two of the most widely-used geocoders and provide analysis o...
متن کاملA research agenda: does geocoding positional error matter in health GIS studies?
Until recently, little attention has been paid to geocoding positional accuracy and its impacts on accessibility measures; estimates of disease rates; findings of disease clustering; spatial prediction and modeling of health outcomes; and estimates of individual exposures based on geographic proximity to pollutant and pathogen sources. It is now clear that positional errors can result in flawed...
متن کامل